PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim11g011940.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 718aa    MW: 81802.7 Da    PI: 6.7839
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim11g011940.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox53.93.1e-172677556
                        SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox  5 ttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                        +++t eq+++Le++F+++++p++++r +L ++ gL+ +q+k+WFqN+R++ k
  Sopim11g011940.0.1 26 HRHTMEQIQRLEAFFKECPHPDENQRNQLGREAGLDPKQIKFWFQNKRTQTK 77
                        46899********************************************988 PP

2START104.22.3e-332394583205
                         HHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
               START   3 aeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.......dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                           + ++e + + + ++p Wv ss     s  ++ + ++f+   +        ++e ++++gvv m++ +l  ++ld   +W + ++    k
  Sopim11g011940.0.1 239 VVASMNEMFELLQMNDPIWVDSSsdggCSIHRESYERIFSNM-NrpyksatARIESSKDCGVVSMPANELIHSFLDPV-KWINLFPtivtK 327
                         567899*****************8885222233333333332.257889999**************************.99888888888* PP

                         EEEEEEECTT...EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEEEECTCEEEE CS
               START  79 aetlevissg...galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgiliepksnghskv 164
                         a t+ev++sg   g +qlm+ +l  lsplv  R+f f+Ry+rq    +w+ vdvS d  ++ ++  +s+    ++pSg+ i++++n  s v
  Sopim11g011940.0.1 328 ARTIEVLDSGtlgGSVQLMYEKLHILSPLVEaREFFFIRYCRQIDPTTWIMVDVSYDLFNEIQSgVPSYSW--KFPSGCAIQDMGNDQSMV 416
                         ******************************99*********************999998887766666655..9***************** PP

                         EEEE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
               START 165 twvehvdlkgrlp.hwllrslvksglaegaktwvatlqrqce 205
                         twvehv +++++    ++r l+  ++  gak+w  +lqr  e
  Sopim11g011940.0.1 417 TWVEHVQVNEKSQvNHIFRDLLCDRQTYGAKRWIVALQRMSE 458
                         ********998766************************9877 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.604.0E-191073IPR009057Homeodomain-like
SuperFamilySSF466893.81E-171080IPR009057Homeodomain-like
PROSITE profilePS5007116.2611979IPR001356Homeobox domain
SMARTSM003894.0E-161983IPR001356Homeobox domain
CDDcd000861.20E-142080No hitNo description
PfamPF000466.3E-152677IPR001356Homeobox domain
PROSITE profilePS5084832.101228462IPR002913START domain
SuperFamilySSF559611.51E-25232460No hitNo description
CDDcd088752.53E-74236458No hitNo description
SMARTSM002342.8E-16237459IPR002913START domain
PfamPF018526.9E-27240458IPR002913START domain
Gene3DG3DSA:3.30.530.209.2E-8240424IPR023393START-like domain
ProDomPD0233776.0E-4271402IPR005266Uncharacterised protein family UPF0128
SuperFamilySSF559619.61E-8481676No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 718 aa     Download sequence    Send to blast
MTDSGEEHIG ESSNSQKKSK RQRQCHRHTM EQIQRLEAFF KECPHPDENQ RNQLGREAGL  60
DPKQIKFWFQ NKRTQTKTQN ERSDNNALRM ENERFLCENM AMKESMKNIM CPKCDGPPIG  120
KEERARNLEN MKLENQRLRE QHEKASNFLS SILGRSFVMG SNLAPPKSTL QTSSNSSDES  180
LLSQNICGSP IRYPPQENNN NVRAHSININ NIPIMSPSRQ EHYEFHHDNR QRTDTFEIVV  240
ASMNEMFELL QMNDPIWVDS SSDGGCSIHR ESYERIFSNM NRPYKSATAR IESSKDCGVV  300
SMPANELIHS FLDPVKWINL FPTIVTKART IEVLDSGTLG GSVQLMYEKL HILSPLVEAR  360
EFFFIRYCRQ IDPTTWIMVD VSYDLFNEIQ SGVPSYSWKF PSGCAIQDMG NDQSMVTWVE  420
HVQVNEKSQV NHIFRDLLCD RQTYGAKRWI VALQRMSERY NFTMGATCPT RHDFEGVFND  480
PEGLKNTIQL SQRMVKNFFE ILSMTDKLDF PASPQLSSGN RISIRKNEEI TQTKGFIATA  540
SSSLWIPLSF QDVFNFFKDN KTRSKWDILT GGLKMTELAR VSTGTFPENC ITIIQSYLQM  600
EKLVLQESSI DEMGAFLIFA PLELPTMTSI FNGHDATKVP ILPSGIIISP DGRLVSDRGN  660
TENAQNGSIL TVTFQILICD NNNISISQQQ HMEVVNSIHS LLRTTVSKIK AALGCSN*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
11722KSKRQR
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755230.0HG975523.1 Solanum lycopersicum chromosome ch11, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004250550.10.0PREDICTED: homeobox-leucine zipper protein ROC8-like
TrEMBLK4D5Z90.0K4D5Z9_SOLLC; Uncharacterized protein
STRINGSolyc11g011940.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA10211732
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73360.11e-160homeodomain GLABROUS 11